NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Tractable MCMC for Private Learning with Pure and Gaussian Differential Privacy

Lin, Yingyu; Ma, Yi-An; Wang, Yu-Xiang; Redberg, Rachel; Bu, Zhiqi (May 2024, ICLR 2024)

Posterior sampling, i.e., exponential mechanism to sample from the posterior distribution, provides ε-pure differential privacy (DP) guarantees and does not suffer from potentially unbounded privacy breach introduced by (ε,δ)-approximate DP. In practice, however, one needs to apply approximate sampling methods such as Markov chain Monte Carlo (MCMC), thus re-introducing the unappealing δ-approximation error into the privacy guarantees. To bridge this gap, we propose the Approximate SAample Perturbation (abbr. ASAP) algorithm which perturbs an MCMC sample with noise proportional to its Wasserstein-infinity (W∞) distance from a reference distribution that satisfies pure DP or pure Gaussian DP (i.e., δ=0). We then leverage a Metropolis-Hastings algorithm to generate the sample and prove that the algorithm converges in W∞ distance. We show that by combining our new techniques with a localization step, we obtain the first nearly linear-time algorithm that achieves the optimal rates in the DP-ERM problem with strongly convex and smooth losses.
more » « less
Full Text Available
Tractable MCMC for Private Learning with Pure and Gaussian Differential Privacy

Lin, Yingyu; Ma, Yian; Wang, Yu-Xiang; Redberg, Rachel E; Bu, Zhiqi (January 2024, The Twelfth International Conference on Learning Representations)

Full Text Available
Tractable MCMC for Private Learning with Pure and Gaussian Differential Privacy

Lin, Yingyu; Ma, Yian; Wang, Yu-Xiang; Redberg, Rachel E; Bu, Zhiqi (January 2024, The Twelfth International Conference on Learning Representations)

Full Text Available
Characterizing the SLOPE trade-off: A variational perspective and the Donoho–Tanner limit

https://doi.org/10.1214/22-AOS2194

Bu, Zhiqi; Klusowski, Jason M.; Rush, Cynthia; Su, Weijie J. (February 2023, The Annals of Statistics)

Full Text Available
Algorithmic Analysis and Statistical Estimation of SLOPE via Approximate Message Passing

https://doi.org/10.1109/TIT.2020.3025272

Bu, Zhiqi; Klusowski, Jason M.; Rush, Cynthia; Su, Weijie J. (January 2021, IEEE Transactions on Information Theory)
null (Ed.)
Full Text Available
Deep Learning with Gaussian Differential Privacy

https://doi.org/10.1162/99608f92.cfc5dd25

Bu, Zhiqi; Dong, Jinshuo; Long, Qi; Su, Weijie (January 2020, Harvard data science review)

Deep learning models are often trained on datasets that contain sensitive information such as individuals' shopping transactions, personal contacts, and medical records. An increasingly important line of work therefore has sought to train neural networks subject to privacy constraints that are specified by differential privacy or its divergence-based relaxations. These privacy definitions, however, have weaknesses in handling certain important primitives (composition and subsampling), thereby giving loose or complicated privacy analyses of training neural networks. In this paper, we consider a recently proposed privacy definition termed \textit{f-differential privacy} [18] for a refined privacy analysis of training neural networks. Leveraging the appealing properties of f-differential privacy in handling composition and subsampling, this paper derives analytically tractable expressions for the privacy guarantees of both stochastic gradient descent and Adam used in training deep neural networks, without the need of developing sophisticated techniques as [3] did. Our results demonstrate that the f-differential privacy framework allows for a new privacy analysis that improves on the prior analysis~[3], which in turn suggests tuning certain parameters of neural networks for a better prediction accuracy without violating the privacy budget. These theoretically derived improvements are confirmed by our experiments in a range of tasks in image classification, text classification, and recommender systems. Python code to calculate the privacy cost for these experiments is publicly available in the \texttt{TensorFlow Privacy} library.
more » « less
Full Text Available
Algorithmic Analysis and Statistical Estimation of SLOPE via Approximate Message Passing

Bu, Zhiqi; Klusowski, Jason; Rush, Cynthia; Su, Weijie (January 2019, Advances in neural information processing systems)

Full Text Available
Algorithmic Analysis and Statistical Estimation of SLOPE via Approximate Message Passing

Bu, Zhiqi; Klusowski, Jason M; Rush, Cynthia; Su, Weijie (January 2019, Advances in Neural Information Processing Systems)

Full Text Available

Search for: All records